updating loading in stable lm demo to use transformer bridge#1023
updating loading in stable lm demo to use transformer bridge#1023jlarson4 merged 36 commits intodev-3.x-canaryfrom
Conversation
…nsformer_bridge_migration
…nsformer_bridge_migration
…nsformer_bridge_migration
…nsformer_bridge_migration
…nsformer_bridge_migration
…nsformer_bridge_migration
…nsformer_bridge_migration
…nsformer_bridge_migration
…nsformer_bridge_migration
…nsformer_bridge_migration
…nsformer_bridge_migration
…nsformer_bridge_migration
…demo_transformer_bridge_migration
…demo_transformer_bridge_migration
…demo_transformer_bridge_migration
…demo_transformer_bridge_migration
…demo_transformer_bridge_migration
…demo_transformer_bridge_migration
…demo_transformer_bridge_migration
…demo_transformer_bridge_migration
…demo_transformer_bridge_migration
…demo_transformer_bridge_migration
…demo_transformer_bridge_migration
…demo_transformer_bridge_migration
…demo_transformer_bridge_migration
…demo_transformer_bridge_migration
…demo_transformer_bridge_migration
…demo_transformer_bridge_migration
…demo_transformer_bridge_migration
…demo_transformer_bridge_migration
…emo_transformer_bridge_migration
…s in HookedTransformer and TransformerLens
|
I ended up having to rewrite a significant portion of this notebook. I was unable to recreate the saved data on this version or on 2.x's Hooked Transformer, I attempted to recreate the output on In July 2023 when this demo was created in #354: In September 2023: IGNORE was changed to The current behavior appears to be the correct behavior, and the original details of this demo were a side effect of the -1e5 buffer bug. |
|
Additionally, I left this commented out of CI for now. The StableLM Alphas are too large to run within the constraints of our current CI infrastructure |
Description
Please include a summary of the change and which issue is fixed. Please also include relevant motivation and context. List any dependencies that are required for this change.
Fixes # (issue)
Type of change
Please delete options that are not relevant.
Screenshots
Please attach before and after screenshots of the change if applicable.
Checklist: